Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision

نویسندگان

چکیده

Recent advances in monocular 3D detection leverage a depth estimation network explicitly as an intermediate stage of the network. Depth map approaches yield more accurate to objects than other methods thanks trained on large-scale dataset. However, can be limited by accuracy map, and sequentially using two separated networks for significantly increases computation cost inference time. In this work, we propose method boost RGB image-based detector jointly training with prediction loss analogous task. way, our supervised supervision from raw LiDAR points, which does not require any human annotation cost, estimate without predicting map. Our novel object-centric focuses around foreground objects, is important object detection, pixel-wise manner. regression model further predict uncertainty represent confidence objects. To effectively train points enable end-to-end training, revisit target design architecture. Extensive experiments KITTI nuScenes benchmarks show that outperform while maintaining real-time speed.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monocular Object Detection Using 3D Geometric Primitives

Multiview object detection methods achieve robustness in adverse imaging conditions by exploiting projective consistency across views. In this paper, we present an algorithm that achieves performance comparable to multiview methods from a single camera by employing geometric primitives as proxies for the true 3D shape of objects, such as pedestrians or vehicles. Our key insight is that for a ca...

متن کامل

Object Localization with Boosting and Weak Supervision for Generic Object Recognition

This paper deals, for the first time, with an analysis of localization capabilities of weakly supervised categorization systems. Most existing categorization approaches have been tested on databases, which (a) either show the object(s) of interest in a very prominent way so that their localization can hardly be judged from these experiments, or (b) at least the learning procedure was done with ...

متن کامل

3D Scene and Object Classification Based on Information Complexity of Depth Data

In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...

متن کامل

3D Object Detection with Kinect

1. Abstract The goal of our project is to develop a general machine learning framework for classifying objects based on RGBD point cloud data from a Kinect. Using this framework, a robot equipped with a Kinect will take the name of an object as input, scan its surroundings, and move to the most likely matching object that it finds. As a proof of concept, we demonstrate our algorithm on an offic...

متن کامل

Sliding Shapes for 3D Object Detection in Depth Images

The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. In this paper, we propose to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, selfocclusion and se...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems

سال: 2022

ISSN: ['1558-0016', '1524-9050']

DOI: https://doi.org/10.1109/tits.2022.3224082